''Markov Chains and Mixing Times'' is a book on

Markov chain mixing time In probability theory, the mixing time of a Markov chain is the time until the Markov chain is "close" to its steady state distribution. More precisely, a fundamental result about Markov chains is that a finite state irreducible aperiodic chain has ...

s. The second edition was written by David A. Levin, and

Yuval Peres Yuval Peres ( he, יובל פרס; born 5 October 1963) is a mathematician known for his research in probability theory, ergodic theory, mathematical analysis, theoretical computer science, and in particular for topics such as fractals and Hausdo ...

Elizabeth Wilmer Elizabeth Lee Wilmer is an American mathematician known for her work on Markov chain mixing times. She is a professor, and former department head, of mathematics at Oberlin College. As a 16-year-old high school student at Stuyvesant High School an ...

was a co-author on the first edition and is credited as a contributor to the second edition. The first edition was published in 2009 by the

American Mathematical Society The American Mathematical Society (AMS) is an association of professional mathematicians dedicated to the interests of mathematical research and scholarship, and serves the national and international community through its publications, meetings, ...

, with an expanded second edition in 2017.

Background

Markov chain A Markov chain or Markov process is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, this may be thought of as, "What happe ...

is a

stochastic process In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables. Stochastic processes are widely used as mathematical models of systems and phenomena that appea ...

defined by a set of states and, for each state, a probability distribution on the states. Starting from an initial state, it follows a sequence of states where each state in the sequence is chosen randomly from the distribution associated with the previous state. In that sense, it is "memoryless": each random choice depends only on the current state, and not on the past history of states. Under mild restrictions, a Markov chain with a finite set of states will have a

stationary distribution Stationary distribution may refer to: * A special distribution for a Markov chain such that if the chain starts with its stationary distribution, the marginal distribution of all states at any time will always be the stationary distribution. Assum ...

that it converges to, meaning that, after a sufficiently large number of steps, the probability of being in each state will close to that of the stationary distribution, regardless of the initial state or of the exact number of steps. The ''mixing time'' of a Markov chain is the number of steps needed for this convergence to happen, to a suitable degree of accuracy. A family of Markov chains is said to be ''rapidly mixing'' if the mixing time is a polynomial function of some size parameter of the Markov chain, and ''slowly mixing'' otherwise. This book is about finite Markov chains, their stationary distributions and mixing times, and methods for determining whether Markov chains are rapidly or slowly mixing. A classical and familiar example of this phenomenon involves shuffling decks of cards: starting from a non-random initial deck of cards, how many shuffles does it take to reach a nearly-

random permutation A random permutation is a random ordering of a set of objects, that is, a permutation-valued random variable. The use of random permutations is often fundamental to fields that use randomized algorithms such as coding theory, cryptography, and sim ...

? This can be modeled as a Markov chain whose states are orderings of the card deck and whose state-to-state transition probabilities are given by some mathematical model of random shuffling such as the

Gilbert–Shannon–Reeds model In the mathematics of shuffling playing cards, the Gilbert–Shannon–Reeds model is a probability distribution on riffle shuffle permutations that has been reported to be a good match for experimentally observed outcomes of human shuffling, and th ...

. In this situation, rapid mixing of the Markov chain means that one does not have to perform a huge number of shuffles to reach a sufficiently randomized state. Beyond card games, similar considerations arise in the behavior of the physical systems studied in

statistical mechanics In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. It does not assume or postulate any natural laws, but explains the macroscopic be ...

, and of certain

randomized algorithm A randomized algorithm is an algorithm that employs a degree of randomness as part of its logic or procedure. The algorithm typically uses uniformly random bits as an auxiliary input to guide its behavior, in the hope of achieving good performan ...

Topics

The book is divided into two parts, the first more introductory and the second more advanced. After three chapters of introductory material on Markov chains, chapter four defines the ways of measuring the distance of a Markov chain to its stationary distribution and the time it takes to reach that distance. Chapter five describes

coupling A coupling is a device used to connect two shafts together at their ends for the purpose of transmitting power. The primary purpose of couplings is to join two pieces of rotating equipment while permitting some degree of misalignment or end mov ...

, one of the standard techniques of bounding mixing times. In this technique, one sets up two Markov chains, one starting from the given initial state and the other from the stationary distribution, with transitions that have the correct probabilities within each chain but are not independent from chain-to-chain, in such a way that the two chains become likely to move to the same states as each other. In this way, the mixing time can be bounded by the time for the two coupled chains to synchronize. Chapter six discusses a technique called "strong stationary times" with which, for some Markov chains, one can prove that choosing a stopping time randomly from a certain distribution will result in a state drawn from the stationary distribution. After a chapter on

lower bound In mathematics, particularly in order theory, an upper bound or majorant of a subset of some preordered set is an element of that is greater than or equal to every element of . Dually, a lower bound or minorant of is defined to be an element ...

s on mixing time based on the " bottleneck ratio" and

isoperimetric number In mathematics, the Cheeger constant (also Cheeger number or isoperimetric number) of a graph is a numerical measure of whether or not a graph has a "bottleneck". The Cheeger constant as a measure of "bottleneckedness" is of great interest in man ...

, the next two chapters of the first part cover two important examples: card shuffling and

random walk In mathematics, a random walk is a random process that describes a path that consists of a succession of random steps on some mathematical space. An elementary example of a random walk is the random walk on the integer number line \mathbb Z ...

s on

graphs Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties *Graph (topology), a topological space resembling a graph in the sense of discre ...

. Chapters 10 and 11 consider two more parameters closely related to the mixing time, the

hitting time In the study of stochastic processes in mathematics, a hitting time (or first hit time) is the first time at which a given process "hits" a given subset of the state space. Exit times and return times are also examples of hitting times. Definitions ...

at which the Markov chain first reaches a specified state, and the cover time at which it has first reached all states. They also discuss time-reversible Markov chains and their connection to electrical networks. The final chapter of this part discusses the connection between the

spectral gap In mathematics, the spectral gap is the difference between the moduli of the two largest eigenvalues of a matrix or operator; alternately, it is sometimes taken as the smallest non-zero eigenvalue. Various theorems relate this difference to othe ...

of a Markov chain and its mixing time. The second part of the book includes many more examples in which this theory has been applied, including the

Glauber dynamics In statistical physics, Glauber dynamics is a way to simulate the Ising model (a model of magnetism) on a computer. It is a type of Markov Chain Monte Carlo algorithm. The algorithm In the Ising model, we have say N particles that can spin up ...

on the

Ising model The Ising model () (or Lenz-Ising model or Ising-Lenz model), named after the physicists Ernst Ising and Wilhelm Lenz, is a mathematical model of ferromagnetism in statistical mechanics. The model consists of discrete variables that represent ...

, Markov models of

chromosomal rearrangement In genetics, a chromosomal rearrangement is a mutation that is a type of chromosome abnormality involving a change in the structure of the native chromosome. Such changes may involve several different classes of events, like deletions, duplicatio ...

, the

asymmetric simple exclusion process In probability theory, the asymmetric simple exclusion process (ASEP) is an interacting particle system introduced in 1970 by Frank Spitzer. Many articles have been published on it in the physics and mathematics literature since then, and it has ...

in which particles randomly jump to unoccupied adjacent spaces, and random walks in the

lamplighter group In mathematics, the lamplighter group ''L'' of group theory is the restricted wreath product \mathbf_2 \wr \mathbf Z Introduction The name of the group comes from viewing the group as acting on a doubly infinite sequence of street lamps \dots,l_, ...

. Topics covered in the second part of the book include more on spectral graphs and

expander graph In graph theory, an expander graph is a sparse graph that has strong connectivity properties, quantified using vertex, edge or spectral expansion. Expander constructions have spawned research in pure and applied mathematics, with several applicati ...

s, path coupling (in which a sequence of more than two Markov chains is coupled in pairs), connections between coupling and the

earth mover's distance In statistics, the earth mover's distance (EMD) is a measure of the distance between two probability distributions over a region ''D''. In mathematics, this is known as the Wasserstein metric. Informally, if the distributions are interpreted ...

, martingales, critical temperatures, the "cutoff effect" in which the probability distribution of the chain transitions rapidly between unmixed and mixed, the evolving set process (a derived Markov chain on sets of states of the given chain), Markov chains with infinitely many states, and Markov chains that operate in continuous time rather than by a discrete sequence of steps. A guest chapter by

Jim Propp James Gary Propp is a professor of mathematics at the University of Massachusetts Lowell. Education and career In high school, Propp was one of the national winners of the United States of America Mathematical Olympiad (USAMO), and an alumnus o ...

and David B. Wilson describes

coupling from the past Among Markov chain Monte Carlo (MCMC) algorithms, coupling from the past is a method for sampling from the stationary distribution of a Markov chain. Contrary to many MCMC algorithms, coupling from the past gives in principle a perfect sample from ...

, a method for obtaining samples drawn exactly from the stationary distribution rather than (as one obtains from

Markov chain Monte Carlo In statistics, Markov chain Monte Carlo (MCMC) methods comprise a class of algorithms for sampling from a probability distribution. By constructing a Markov chain that has the desired distribution as its equilibrium distribution, one can obtain ...

methods) approximations to this distribution. The final chapter collects open problems in this area.

Audience and reception

This book can be used either as a reference by researchers in areas using these methods, or as the basis for a graduate-level course, particularly one limited to the more introductory material in the first part of the book where only an undergraduate-level knowledge of

probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...

and

linear algebra Linear algebra is the branch of mathematics concerning linear equations such as: :a_1x_1+\cdots +a_nx_n=b, linear maps such as: :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrices. ...

is required. However, reviewer

Rick Durrett Richard Timothy Durrett is an American mathematician known for his research and books on mathematical probability theory, stochastic processes and their application to mathematical ecology and population genetics. Education and career He rece ...

suggests that the book's contents would be too advanced for undergraduate courses, even at research-level universities, and reviewer Takis Konstantopoulos suggests that the book's contents would be better appreciated by a reader who has already had some exposure to the material that it covers. Reviewer

Olle Häggström Olle Häggström (born 4 October 1967) is a professor of mathematical statistics at Chalmers University of Technology. Häggström earned his doctorate in 1994 at Chalmers University of Technology with Jeffrey Steif as supervisor. He became an as ...

calls the book "authoritative and highly readable". Reviewer H. M. Mai writes that its explanations are careful and "well motivated", and that the writing is "lucid and clear". Reviewer László Lakatos calls it "a brilliant guide to the modern theory of Markov chains". And reviewer

David Aldous David John Aldous FRS (born 13 July 1952) is a mathematician known for his research on probability theory and its applications, in particular in topics such as exchangeability, weak convergence, Markov chain mixing times, the continuum random ...

predicts that it "will long remain the definitive required reading" in this area.

References

{{reflist, refs= {{citation, last=Mai, first=H. M., journal=

zbMATH zbMATH Open, formerly Zentralblatt MATH, is a major reviewing service providing reviews and abstracts for articles in pure mathematics, pure and applied mathematics, produced by the Berlin office of FIZ Karlsruhe – Leibniz Institute for Informa ...

, title=Review of ''Markov Chains and Mixing Times'' (1st ed.), zbl=1160.60001 {{citation, last=Lakatos, first=László, journal=

, title=Review of ''Markov Chains and Mixing Times'' (2nd ed.), zbl=1390.60001 {{citation, last=Häggström, first=Olle, authorlink=Olle Häggström, journal=

Mathematical Reviews ''Mathematical Reviews'' is a journal published by the American Mathematical Society (AMS) that contains brief synopses, and in some cases evaluations, of many articles in mathematics, statistics, and theoretical computer science. The AMS also pu ...

, mr=2466937, title=Review of ''Markov Chains and Mixing Times'' (1st ed.), year=2010 {{citation, last=Aldous, first=David, authorlink=David Aldous, date=March 2019, doi=10.1007/s00283-018-9839-x, issue=1, journal=

The Mathematical Intelligencer ''The Mathematical Intelligencer'' is a mathematical journal published by Springer Verlag that aims at a conversational and scholarly tone, rather than the technical and specialist tone more common among academic journals. Volumes are released quar ...

, mr=3918079, pages=90–91, title=Review of ''Markov Chains and Mixing Times'' (2nd ed.), volume=41 {{citation, last=Konstantopoulos, first=Takis, doi=10.1137/19N974865, issue=3, journal=

SIAM Review Society for Industrial and Applied Mathematics (SIAM) is a professional society dedicated to applied mathematics, computational science, and data science through research, publications, and community. SIAM is the world's largest scientific socie ...

, mr=3989422, pages=631–634, title=Review of ''Markov Chains and Mixing Times'' (2nd ed.), volume=61, year=2019 {{citation, last=Durrett, first=Rick, authorlink=Rick Durrett, date=September 2019, journal=MAA Reviews, publisher=

Mathematical Association of America The Mathematical Association of America (MAA) is a professional society that focuses on mathematics accessible at the undergraduate level. Members include university, college, and high school teachers; graduate and undergraduate students; pure a ...

, title=Review of ''Markov Chains and Mixing Times'' (2nd ed.), url=https://www.maa.org/press/maa-reviews/markov-chains-and-mixing-times-0

External links

Author web site with errata and a downloadable copy of the book
Markov processes Markov chain Monte Carlo Mathematics books 2008 non-fiction books 2017 non-fiction books Publications of the American Mathematical Society